Coherent feedback that beats all measurement-based feedback protocols 
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We show that for a widely applicable minimal-time control problem, a coherent feedback proto- 
col is optimal, and is faster than all measurement-based feedback protocols, where the latter are 
defined in a strict sense. Our analysis reveals why coherent feedback can be expected to be supe- 
rior to measurement-feedback for a broad range of control problems: the optimal protocol exploits 
a geodesic in Hilbert space, a path that measurement protocols cannot follow. Our results also 
provide a guide to the design of coherent feedback protocols. 
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Feedback control, useful in reducing the effects of noise 
on quantum dynamical systems pQ, can be implemented 
in one of two ways: repeatedly measure the system and 
apply operations conditional on the measurement re- 
sults [H [3], or couple the system to another quantum 
system in such a way that a similar flow of information 
occurs between the two, but without explicit measure- 
ments j3HZ]- This second form of feedback is referred to 
as coherent feedback. Our motivation here is that numer- 
ical optimization performed on linear feedback networks 
by Nurdin, James, and Petersen [6] showed that coherent 
feedback can outperform feedback mediated by measure- 
ments, but the reason for this intriguing difference has 
remained largely a mystery. Just recently it has been 
shown that the difference is more pronounced for cooling 
an oscillator, again in the linear setting [8]. The authors 
of [8] suggest that passing the noise signal though the 
auxiliary system coherently allows it to cancel the quan- 
tum noise in a way that a measured signal cannot. But 
it is unclear how to design protocols to exploit the co- 
herent advantage, and a state-space explanation for this 
advantage is yet missing. Understanding why the evolu- 
tion accessible by coherent feedback is superior to that of 
measurement-based feedback should provide significant 
insight into the design of control protocols. 

Here we present a clear state-space explanation for 
the superior performance of coherent feedback, and show 
how it informs the design of coherent feedback proto- 
cols. To do this we consider the following simple con- 
trol task: transforming an arbitrary mixed state of an 
TV-dimensional system to a specified pure state in the 
minimal time. We will show that coherent feedback 
can perform this faster than measurement-based feed- 
back defined in a strict sense, with corresponding impli- 
cations for measurement-feedback defined in the broadest 
sense. Because this simple control task involves extract- 
ing entropy from a system (reducing it from a mixture 
to a pure state) , our result shows that coherent feedback 
can extract entropy faster than measurement-feedback. 
This has profound consequences for all feedback proto- 
cols whose purpose is to reduce the effects of noise: since 



noise continually injects entropy into a system, the faster 
a protocol can extract this entropy the better it will per- 
form, other things being equal. 

To begin, we note that any control problem is non- 
trivial only under a constraint on the dynamics. Here we 
will allow any joint unitary evolution to be applied to the 
system and any auxiliary system, under the minimal con- 
straint that the speed of this dynamics is bounded. Since 
measurements are mediated by unitary interactions be- 
tween the system and an auxiliary, we can apply this 
constraint to both kinds of feedback. Denoting the dif- 
ference between the minimum and maximum eigenvalues 
of the joint system/auxiliary Hamiltonian by A, our con- 
straint is A < 2/i. The constant p, may be thought of as 
the maximal speed of rotation that the Hamiltonian can 
generate in Hilbert space [§]. 

To proceed we must precisely define the two forms of 
feedback control, and there is a subtlety involved in this 
definition. A single "step" of a measurement-feedback 
protocol consists of making a measurement, and apply- 
ing a unitary operation to the system conditional on the 
measurement result. A protocol consists of a sequence of 
these steps. To evaluate the performance of a protocol 
whose purpose is to prepare a given pure state, one 
must average over all the possible measurement results 
and any other noise on the system. A useful measure of 
the performance is the probability that the system will 
be found in state \ip), and this is P = Tr[|-0)('0|p], where 
p denotes the averaged system density matrix. 

All quantum measurements are described by a set of 
measurement operators {A n : n — 1, . . . , A}, where n la- 
bels the measurement results [TO]. For a system initially 
in state p, upon obtaining result n the state becomes 
p n = AnpA^/TrlA^Anp], and the post-measurement 
state averaged over all the results is p = J2 n A n pA\. The 
polar decomposition theorem provides a natural way to 
define measurement-based feedback: each operator A n 
can be decomposed as A n = U n P n , where P n is positive 
and U n is unitary. Since the unitary operator U n pre- 
serves the entropy of the system, it is the set {P n } alone 
that describes the measurement. Upon obtaining result 
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n from the measurement {P n }, the feedback U n can be 
applied by the observer. Further, measurements whose 
operators are all positive (we will call these bare mea- 
surements) cannot decrease the von Neumann entropy of 
the system: for all bare measurements S(p) > S(p) [TT] . 
Thus under the above definition of measurement-based 
feedback, the feedback part of the process is required to 
prepare an initial mixed system in a pure state. 

Continuous measurements of Hcrmitian observables 
are bare measurements [H] , and thus feedback loops that 
use these measurements are measurement-based proto- 
cols in the above sense. Nevertheless there are contin- 
uous measurements that are not bare and that are rou- 
tinely implemented experimentally. An example is pho- 
ton counting or homodyne detection of the light output 
from an optical cavity [THj. The mere fact that light 
leaks out of the cavity reduces the cavity energy, and 
can reduce the entropy of the cavity mode without any 
feedback or even measurement. We do not wish to argue 
at this point as to whether of not feedback using homo- 
dyne detection should be regarded as pure measurement- 
feedback or not. Instead we will refer to feedback us- 
ing bare measurements as "strict-sense" measurement- 
feedback, and that mediated by more general measure- 
ments as "broad-sense" measurement-feedback. 

In fact, any measurement-feedback protocol can be im- 
plemented in a fully unitary way, in which case it be- 
comes a coherent feedback protocol. First, to implement 
a general M-outcome measurement one couples the sys- 
tem to an auxiliary system and performs a join unitary 
operation on them. One then makes a projection mea- 
surement on the auxiliary, projecting it onto one of a set 
of basis states that we will label |m) a . By selecting the 
appropriate joint unitary, one can implement any mea- 
surement: that is, the system will be taken to the state 
p m = A m pA\ n /Tr:[A] n A m p\ for each outcome m of the 
projection measurement for any chosen set of measure- 
ment operators A m . 

To implement a measurement-feedback process coher- 
ently, we note that in the above measurement procedure, 
after the joint unitary, within the subspaces defined by 
each of the states |m) a the system has the state p m . 
We can therefore implement the feedback without mak- 
ing the projective measurement, by instead applying the 
joint unitary 

^fb = ^c4 m) 0|m) a Ha. (1) 

I a 

This unitary applies to the subspace in which the 

system has state p m , as required. In this case the aver- 
age over all the measurement results is given by tracing 
over the auxiliary system. So long as the measure of 
performance involves only the average state, as does P, 
the above coherent feedback procedure reproduces the 
measurement-feedback. 



A coherent feedback protocol is any process in which 
the system is coupled to an auxiliary, a joint unitary 
is performed, and the auxiliary is traced over. A co- 
herent protocol can also involve the repeated use of 
this process. Every coherent feedback protocol can be 
described by a trace preserving operation of the form 
p — > p = Yl n AnpAh, for some set of measurement op- 
erators. So what is the difference between coherent and 
measurement-based feedback? The difference between 
the two, both for strict-sense and broad sense measure- 
ments, is that the latter have restrictions, and are thus 
a subset of the former. For measurement-feedback in 
the strict sense, each step of the coherent version must 
be divided into two joint-unitary operations, U\ and U2, 
performed in sequence. The restrictions are as follows: 
i) The unitary U\ cannot decrease the entropy of the 
system. That is S(p) = SiTr^wUl}) > S(p), where 
w = p ® a, p and a are respectively the initial states of 
the system and auxiliary, and Tr a [---] denotes the trace 
over the auxiliary, ii) The unitary U2 must have the form 
of Ufb in Eq.0. 

To define measurement feedback in the broader sense, 
we use the broadest possible definition: we place no re- 
striction on U\ save the constraint (s) imposed by the con- 
trol problem. Even in this case there remains a difference 
between coherent and measurement feedback. In a real 
example of the latter the feedback part, given by U2, must 
have an effect on the performance (or it would not be a 
feedback protocol at all). Thus measurement-feedback in 
the broadest sense is a feedback process in which [7 2 is 
necessary to obtain the performance of the protocol, and 
where U2 is restricted to the form in Eq. ([!]) . We will find 
that this restriction is significant. 

We first present the optimal coherent feedback pro- 
tocol for our control task. For this purpose we draw 
from the analysis of cooling in [14 . Denoting a basis 
for the system by we may write the initial sys- 

tem state as p = ^2 n p n \n)(n\, and that of the auxiliary 
as a = |0) a (0| a . We can prepare the system in the single 
state |0) by applying a Hamiltonian that transforms each 
of the joint states |n)|0) a respectively to |n)|0) a . In fact, 
for the initial state in question this transformation swaps 
the states of the two systems. Now, given the Hamilto- 
nian bound A < 2/i, the fastest way to transform one 
state to another is to follow the geodesic (the shortest 
distance in Hilbert space) [5]. A Hamiltonian that does 
this is 

Hr = M(|»>|0)a<0|<n| a + |0)|n) a <n|<0| a ) , (2) 

and it maximizes the constraint, achieving the transfor- 
mation in the shortest time, r opt = 7r/(2/i). We can 
perform all the transformations at the same time by us- 
ing the Hamiltonian H geo = J2 n -^n C °^ an d still satisfy 
A = 2p. Thus -ffgco is an optimal coherent protocol. 

As we have elucidated above, under the performance 
measure P, coherent feedback protocols subsume all 
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measurement-feedback protocols, thus the latter cannot 
do better than H gco . The key question is whether any 
measurement-feedback protocol can do as well as H geo . 
First let us consider measurement-feedback in the strict 
sense. Recall that the feedback operation is a set of 
unitary operations performed on the system alone, and 
therefore cannot reduce the system's entropy. As a re- 
sult, the measurement part of the protocol must reduce 
the entropy of the system to zero, which means that the 
projection measurement on the auxiliary must induce a 
projection measurement on the system. This will only 
happen if a set of orthogonal states of the auxiliary are 
perfectly correlated with a set of pure states of the sys- 
tem. To project the system onto all its basis states, this 
set of pure states must include a basis for the system. 
Now we need to examine how the joint unitary U± cre- 
ates the required correlation. Since the auxiliary starts 
in state |0) a , for each of the basis states of the system, 
U± must rotate the state of the auxiliary to a different or- 
thogonal state. In fact, the time taken to perform these 
rotations depends on the dimension of the system. As- 
sume the system has two states labelled |+) and |— ). 
Then conditional on the state |±) one can rotate |0) a by 
±45° to obtain the state |±) a = (|0) a ± |l) a ) /V2. The 
states |+) a and |— ) a are orthogonal, but since both ro- 
tation angles are only 45°, the required time is 7r/(4/^) 
rather than the 7r/(2/i) required to transform one state 
into another that is orthogonal to it. If we increase the 
size of the system, N, then the situation changes. The 
need to transform |0) a to a larger number of mutually or- 
thogonal states necessitates increasing the rotation angle, 
since the overlap of at least one of these orthogonal states 
with the initial state must be no greater than 1/viV. The 
minimum rotation angle is thus On = cos~ 1 (l/v]V), and 
the minimum rotation time is #at//z. 

Now we need to know the time required for the feed- 
back operation. Since the measurement cannot decrease 
the entropy of the system, it must produce the same num- 
ber of orthogonal output states as the size of the support 
of p, which we will call K. This can be seen by noting 
that p could have maximum entropy on its support. The 
final average state must also have at least this entropy, 
and so must contain at least K orthogonal states. The 
feedback process must rotate each of the K output states 
to the desired pure state, 1-0). By the same argument as 
that used above for the time taken by U\ , this requires a 
time 6k / Since K can be as large as N , there are initial 
states p for which the required time is We can con- 

clude that to prepare an ./V-dimensional system in a pure 
state, from an arbitrary initial state, using measurement- 
feedback in the strict sense, requires a minimum time of 
T mf = 2cos~ 1 (l/VN)/p. For a single qubit, r mf = r opt , 
but for N > 3, r m f > r opt . The ratio T m f/r opt increases 
monotonically with N, and as N —¥ oo, r m f = 2r op t. 

Now consider measurement-feedback in the broadest 
sense. In this case only the second part of the protocol 



- the feedback — is constrained. However, in a real 
measurement-based feedback protocol, the feedback part 
must have a non-trivial effect on the system. That is, 
the feedback part must be required to obtain the per- 
formance, or it would be discarded and the "protocol" 
would not longer be a feedback protocol, merely a mea- 
surement process. But the geodesic evolution is given by 
a Hamiltonian than cannot be written in the form re- 
quired of the measurement-based feedback Hamiltonian, 
Eq.|l]). Therefore, to the extent that a feedback proto- 
col uses measurement-feedback, to that extent it cannot 
follow the optimal path, and it therefore must be slower. 

The above results provide a guide to the design of 
coherent feedback protocols. Every interaction of the 
form H m = XY, where X and Y are Hermitian oper- 
ators for different systems, generates a unitary of the 
measurement- feedback form, Eq. 0. (Unitaries that im- 
plement bare measurements are also of this form, but 
with the system and auxiliary switched.) It is simple to 
show that every interaction of this form has eigenvectors 
that are unentangled product states of the two systems. 
On the contrary, the optimal coherent-feedback interac- 
tion Hamiltonian, Eq.Q, does not have the form of H m . 
It is instead a sum of terms of the form H c = AB^ + A^B, 
where both A and B are not Hermitian. This kind 
of interaction has matrix elements that simultaneously 
change the states of both systems, and in particular has 
eigenstates of that are maximally entangled states of the 
two systems. We therefore expect appropriately cho- 
sen interactions with terms of the form H c to be more 
efficient at extracting entropy than interactions of the 
form H m . In fact, the difference between the coherent 
and measurement-based feedback scenarios compared by 
Nurdin et al. [S] is exactly the replacement of interactions 
of the form H m with those of the form H c . 

We now test our expectation by comparing various 
continuous-time coherent feedback protocols in which the 
objective is to place a thermal mechanical resonator in 
its ground state. As "case 1" we take the scenario consid- 
ered by Hamerly and Mabuchi [8] , in which the resonator 
is coupled to a zero-temperature cavity mode (an auxil- 
iary resonator) by one-way (cascade) fields. The position 
of the resonator is coupled to an output field, and this 
output is fed into the auxiliary. The feedback is obtained 
by coupling the output of the auxiliary to the position of 
the resonator, so that the auxiliary applies a force to the 
resonator. The result of this is an effective coupling of the 
form xp a between the two oscillators, where x = (b + b^) 
is the mechanical position, and p a = —i(a — a}) is the 
auxiliary momentum. The quantum Langevin equations 
of motion for the coupled systems are [8] 

a = -[iA + r»/2]a + Xx + EiV^i<(t) (3) 

b = - [i A + 7 /2] b - iX Pa + ^b in (t). (4) 

Here A is the effective coupling rate between the two os- 
cillators, and A = y/kTi, where k is the output coupling 
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FIG. 1. The minimal value of the mechanical steady-state av- 
erage phonon number is plotted for various coherent feedback 
protocols, as a function of the effective interaction rate, A. 
See text for the definitions of the various cases. Light dashed 
line: case 1, A = (fails to cool). Light solid line: case 1, 
A = 10A; (achieves cooling for A < A). Dark dashed line: 
case 2 (achieves cooling with no lower bound). Dark solid 
line: case 3 (achieves the best cooling). 



achieved, but we can cool as cold as we like merely by 
increasing A. We further note that the interaction H^ is 
obtained from Hi merely by making the rotating-wave 
approximation 116] . This means that setting A > A in 
case 1 will effectively transform Hi into H^. So returning 
to case 1 we choose A = 10. We find as expected that 
for small A we obtain the performance of case 2, and as A 
increases this performance degrades. Finally we simulate 
case 3, and as expected we find that the performance is 
consistently better than case 2. The steady-state values 
of (n) for all cases are shown in Fig. [TJ 

Acknowledgements: The authors are partially sup- 
ported by the NSF projects PHY-1005571, PHY- 
0902906, and PHY-1212413, and by the ARO MURI 
grant W911NF-11-1-0268. 



rate for x, and Ti is one of two damping rates for the 
auxiliary, the other one being T2- The damping rate 
of the mechanical resonator is 7, and A is the effective 
frequency of both oscillators. The latter is adjustable 
by detuning the cavity input field, modulating the in- 
teraction with the position, and working in a rotating 
frame [IS])- The input quantum noise sources a™(t) are 
at zero temperature, so that (a\ n (t)a^ (<')) = 5(t' — t), 
while (b m {t),b l ^(t')) = (n T + l)5(t' -t), where n T is the 
average number of thermal phonons in the resonator. 

As our second case we take the same system, but re- 
place the interaction Hi — Xxp a = —iA(b + b^)(a — a^) 
with H2 — i\{ba) — b^a), an interaction similar to H geo . 
Recall that since the control is steady-state, entropy ex- 
tracted from the system must be continually dumped out 
of the auxiliary, and this is achieved by the damping rates 
Fj. However, this damping is equivalent to a continuous 
measurement process, and we can expect this to interfere 
with the near-geodesic interaction. We therefore consider 
a third case, in which the auxiliary is not damped at all. 
In this case we replace the auxiliary with a fresh auxiliary 
after a time r ~ ir/ (2A) — the time it takes to complete 
a single state-swap — and repeat this process. 

To compare the above cases we measure time in units 
of 1/fc, and set the parameters of the mechanical res- 
onator to be ut = 10 and 7 — O.Olfc. We then mini- 
mize the mechanical steady-state average phonon num- 
ber, (n) = (b^b), over the control parameters Ti and 
T2, for each value of A. Choosing A = we find that 
no cooling is possible: while cooling is in theory possible 
with this configuration, the simple auxiliary does not pro- 
cess the position information in the right way to achieve 
it. Switching to case 2, we find that not only is cooling 
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